Production and use of modified cystatins

ABSTRACT

Cystatins that have been modified by glycosylation in order to enhance stability and activity are disclosed, as are methods of making such cystatins and methods of using such cystatins to inhibit proteolysis.

FIELD OF THE INVENTION

[0001] The present invention relates to protease inhibitors, specifically to cystatins, that have been modified by glycosylation in order to enhance stability and activity; to methods of making such modified protease inhibitors and to methods of using such modified protease inhibitors to inhibit proteolysis of a protein substrate.

BACKGROUND

[0002] Proteases are enzymes that degrade proteins. Proteases are classified by the substrate upon which they act and include serine proteases, cysteine proteases, aspartate proteases and metalloproteinases. Serine and cysteine proteases are widespread and are found in diverse organisms including eukaryotic and prokaryotic animals and plants. Cysteine proteases are generally well characterized enzymes having a known primary structure composed of alpha helices and beta pleated sheets (8).

[0003] Proteases mediate many processes that are harmful to man, either by producing pathology or by causing economic loss, for instance by degrading foods. Protease-mediated pathology is known to be caused by a wide variety of organisms including bacteria, such as staphylococci and streptococci, fungi, arthropods, nematodes, protozoa such as amoebas, intestinal flagellates, haemoflagelates, such as Leishmania and trypanosomes and helminths.

[0004] Proteases are known to be important in the pathology of certain viruses (9, 11, 12, 31) including Polio virus, Herpes virus, Corona virus, HIV and Rotavirus. Proteases are also known to play a role in various diseases with no clear etiological agent, such as muscular dystrophy (7) and cancers (1) including breast cancer (2), and amyloid angiopathy, a genetic disease that often leads to fatal cerebral hemorrhages in young adults (10, 13).

[0005] Proteases are also responsible for the spoilage of economically important foodstuffs, necessitating huge annual expenditures on preventative measures. For instance, the fungus Botrytis cinera causes widespread disease in over thirty species of commercial crop plants. Molds and fungi are the major destroyers of citrus fruit crops. Foods rich in protein, such as meat and fish, are degraded and made inedible by proteases from Pseudomonads and other bacteria. Foods with a high muscle content may be quickly broken down by endogenous proteases released from tissues upon death. These enzymes degrade myosin, and destroy the texture of the food. An important example of such spoilage occurs during the processing of surimi, which is a form of processed minced fish, commonly made from Pacific Whiting (Merluccius productus), and is the main ingredient of seafood analogs such as “imitation crab meat”. Surimi is an important source of relatively cheap, high quality, low fat protein important to the diet of many people in the Far East and of increasing economic significance worldwide. Endogenous proteolytic enzymes released during surimi production cause rapid degradation of muscle tissue and lead to poor quality surimi (3, 4, 5, 6, 11, 17, 18, 19). It is thought that the protease released from the fish tissue is a cathepsin, which is a common cysteine protease (3).

[0006] Because of the role of proteases in these various processes, protease inhibitors have been investigated for their potential role in preventing disease and degradation of foodstuffs (4, 5, 18). Partially refined substances that contain protease inhibitors are commonly used in food processing, for instance, to prevent proteolytic breakdown of fish protein during the production of surimi (4, 17, 18). The most commonly used food-grade protease inhibitors are beef plasma protein (BPP), egg white powder and potato powder (4). Genetic engineering techniques have been used to introduce protease-inhibitor genes from chickens into cereal and grass plants to control protease-producing plant pathogens. Likewise, a plant protease inhibitor gene, from Cowpea, has been recombinantly introduced into tobacco, tomato, cotton and other plants to inhibit destruction by nematode worms (16).

[0007] Cystatins are cysteine protease inhibitors that are members of Family 2 of the cystatin superfamily, characterized by a single chain of about 115 to 122 amino acids with a molecular weight of about 13000, having two disulfide bonds (7, 8, 10). Cystatins and protease enzymes such as cathepsins form tight (but reversible) enzyme-inhibitor complexes with dissociation constants typically in the nannomolar range (10).

[0008] A number of cystatins have been characterized including human cystatins (C, S, SN, SA, D, M and E), mouse cystatin, egg-white cystatin, bovine cystatin, carp cystatin, trout cystatin and salmon cystatin. In their natural state, cystatins protect the body by inhibiting the potentially harmful effects of proteolysis, and may prevent destruction of connective tissue by protease enzymes, for instance, lysosomal proteases, released from dying or damaged cells (16).

[0009] Cystatins have been investigated for potential medical applications, for instance, to inhibit replication and pathology of Picornaviruses (12), Coronaviruses (9) and Herpes Simplex type 1 virus (15, 16, 30). Cystatins may also play a natural role in prevention of bacterial infection by E. coli, Shigella (13), Leishmania, Schistosoma and Entamobea (10) which appear to use proteases to facilitate tissue invasion.

[0010] Cystatins are likely the primary protease inhibitors in food-grade protease inhibitor preparations such as beef plasma protein and egg white powder. Since cystatins are themselves proteins, they are prone to denaturation and loss of activity when exposed to unfavorable temperatures or pH. Many food production processes, including surimi production, involve elevated temperatures (17). Presently, in order to maintain cystatin activity, more cystatin must be added after cooling. Adding additional cystatin is both labor-intensive and expensive. Also, when cystatins are used for medical treatment, either as a topical or ingested medication, it is preferable for the cystatin-containing composition to be sterile. A common method of sterilization involves treatment with elevated temperatures. A cystatin that could maintain activity despite exposure to elevated temperatures would thus be useful in food processing and in drug formulation.

SUMMARY OF THE DISCLOSURE

[0011] The present invention provides modified, glycosylated, heat-stable cystatins and methods of making and using these cystatins. The present invention also provides nucleic acid molecules encoding such cystatins.

[0012] The nucleic acid molecules of the invention have been modified so that when such a nucleic acid molecule is expressed in a eukaryotic cell, certain amino-acid residues of the expressed cystatin protein are glycosylated during post-translational modification of the protein. The resulting mature protein has attached, at specific amino acid residues, sugar molecule chains of varying length. The present invention includes the nucleic acid molecules that encode modified cystatins based on the cystatins from humans (C, S, SN, SA, D, M and E), egg white, cow, carp, trout and salmon.

[0013] Various residues in the cystatin primary amino acid sequence have been identified where the introduction glycosylated residues increases heat stability of the expressed protein without severely affecting enzymatic activity. In human cystatin C, for instance, the sites for glycosylation include amino acid residues at positions 35, 36 and 79.

[0014] The present invention also includes a method of making modified heat-stable cystatins by modifying the nucleic acid molecules that encode cystatins. Such nucleic acid molecules are modified at certain defined sites and expressed in a eukaryotic cells. The present invention also includes a cell that contains at least one nucleic acid molecule encoding at least one modified, glycosylated, heat-stable cystatin. The cells of the invention may be of many types, for instance they may be cells from a yeast, a mammal, an insect, or a plant.

[0015] The invention also includes methods of inhibiting proteolysis of a protein substrate by contacting the protein substrate with a modified heat-stable cystatin having at least one engineered glycosylation site. Such a method may be applied, for example, to food processing, such as the production of surimi.

[0016] The invention also includes a method of treating a protease-mediated pathology of an organism, such as a mammal, a fish or a plant by administering to the organism a modified heat-stable cystatin of the invention. By such administration, the modified heat-stable cystatin contacts the protease that mediates the pathology, thereby inhibiting proteolysis by the protease and thereby treating the pathology.

SEQUENCE LISTING

[0017] SEQ. I.D. No. 1 shows the cDNA sequence and the amino acid sequence of native human cystatin C.

[0018] SEQ. I.D. No.2 shows the amino acid sequence of native human cystatin C.

[0019] SEQ. I.D. No.3 shows the cDNA sequence and the amino acid sequence of native human cystatin S.

[0020] SEQ. I.D. No.4 shows the amino acid sequence of native human cystatin S.

[0021] SEQ. I.D. No.5 shows the cDNA sequence and the amino acid sequence of native human cystatin SN.

[0022] SEQ. I.D. No.6 shows the amino acid sequence of native human cystatin SN.

[0023] SEQ. I.D. No.7 shows the cDNA sequence and the amino acid sequence of native human cystatin SA.

[0024] SEQ. I.D. No. 8 shows the amino acid sequence of native human cystatin SA.

[0025] SEQ. I.D. No. 9 shows the cDNA sequence and the amino acid sequence of native human cystatin D.

[0026] SEQ. I.D. No.10 shows the amino acid sequence of native human cystatin D.

[0027] SEQ. I.D. No. 11 shows the cDNA sequence and the amino acid sequence of native human cystatin M.

[0028] SEQ. I.D. No. 12 shows the amino acid sequence of native human cystatin M.

[0029] SEQ. I.D. No. 13 shows the cDNA sequence and the amino acid sequence of native human cystatin E.

[0030] SEQ. I.D. No. 14 shows the amino acid sequence of native human cystatin E.

[0031] SEQ. I.D. No. 15 shows the cDNA sequence and the amino acid sequence of native egg white cystatin.

[0032] SEQ. I.D. No. 16 shows the amino acid sequence of native egg white cystatin.

[0033] SEQ. I.D. No.17 shows the cDNA sequence and the amino acid sequence of native carp cystatin.

[0034] SEQ. I. D. No. 18 shows the amino acid sequence of native carp cystatin.

[0035] SEQ. I.D. No. 19 shows the cDNA sequence and the amino acid sequence of native salmon cystatin.

[0036] SEQ. I.D. No. 20 shows the amino acid sequence of native salmon cystatin.

[0037] SEQ. I.D. No. 21 shows the cDNA sequence and the amino acid sequence of native trout cystatin.

[0038] SEQ. I.D. No. 22 shows the amino acid sequence of native trout cystatin.

[0039] SEQ. I.D. No.23 shows the cDNA sequence and the amino acid sequence of native bovine cystatin.

[0040] SEQ. I.D. No. 24 shows the amino acid sequence of native bovine cystatin.

[0041] SEQ. I.D. No. 25 shows the first of four oligonucleotides used to create a nucleotide coding for modified human cystatin C.

[0042] SEQ. I.D. No. 26 shows the second of four oligonucleotides used to create a nucleotide coding for synthetic human cystatin C.

[0043] SEQ. I.D. No. 27 shows the third of four oligonucleotides used to create a nucleotide coding for synthetic human cystatin C.

[0044] SEQ. I.D. No. 28 shows the fourth of four oligonucleotides used to create a nucleotide coding for synthetic human cystatin C.

[0045] SEQ. I.D. No. 29 shows a forward primer used in site-directed mutagenesis to introduce a glycosylation site at residue 35 of a modified human cystatin C.

[0046] SEQ. I.D. No. 30 is a reverse primer used in site-directed mutagenesis to introduce a glycosylation site at residue 35 of a modified human cystatin C.

[0047] SEQ. I.D. No. 31 is the forward primer used in site-directed mutagenesis to introduce a glycosylation site at residue 36 of a modified human cystatin C.

[0048] SEQ. I.D. No. 32 is the reverse primer used in site-directed mutagenesis to introduce a glycosylation site at residue 36 of a modified human cystatin C.

BRIEF DESCRIPTION OF THE DRAWINGS

[0049]FIG. 1 shows the native amino acid sequence of the mature human cystatin C peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.

[0050]FIG. 2 shows the native amino acid sequence of the mature human cystatin S peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.

[0051]FIG. 3 shows the native amino acid sequence of the mature human cystatin SN peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.

[0052]FIG. 4 shows the native amino acid sequence of the mature human cystatin SA peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.

[0053]FIG. 5 shows the native amino acid sequence of the mature human cystatin D peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.

[0054]FIG. 6 shows the native amino acid sequence of the mature human cystatin M peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.

[0055]FIG. 7 shows the native amino acid sequence of the mature human cystatin E peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.

[0056]FIG. 8 shows the native amino acid sequence of the mature Egg White cystatin peptide without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.

[0057]FIG. 9 shows the native amino acid sequence of the mature bovine cystatin without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.

[0058]FIG. 10 shows the native amino acid sequence of the mature carp cystatin without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.

[0059]FIG. 11 shows the native amino acid sequence of the mature trout cystatin without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.

[0060]FIG. 12 shows the native amino acid sequence of the mature chum salmon cystatin without its signal sequence. Amino acid modifications for introducing glycosylation sites are shown below the native sequence.

DESCRIPTION OF THE INVENTION

[0061] A. Definitions

[0062] A protein is said to be modified when it has been intentionally, artificially altered from its naturally occurring, wild-type form, e.g., when the primary amino acid sequence of a cystatin protein has been intentionally, artificially altered to add a non-native glycosylation site somewhere in the protein. A DNA sequence is said to be modified when it has been intentionally, artificially altered from its naturally occurring, wild-type form, e.g., when the nucleotide coding for a cystatin protein has been mutated by site-directed mutagenesis to create a non-native nucleotide addition, deletion of substitution. The term engineered may be used synonymously with the term modified.

[0063] To say that an organism, or nucleotide sequence has been genetically engineered means that it has been intentionally, artificially genetically altered from its naturally occurring, wild-type genetic form, for instance, a DNA sequence is said to be genetically engineered when its sequence has been intentionally, artificially genetically altered from its wild-type sequence. Any genetically engineered organism or nucleotide sequence has been modified.

[0064] Heat stability refers to the ability of a protein to function at high temperatures. Proteins typically lose activity as the temperature is raised above their normal in vivo operating range, and generally become denatured as the temperature increases further. A modified protein may be more heat stable than the unmodified form of the protein, meaning that the modified form of the protein retains greater activity at higher temperatures than the unmodified form of the protein.

[0065] To say a feature has been introduced means that a non-wild-type feature has been intentionally, artificially added, for instance, a glycosylation site may be said to be introduced into a protein if it has been intentionally, artificially added to the protein, also a nucleotide may be said to be introduced into a cell or other nucleotide sequence when it has been intentionally, artificially added into a cell or sequence, likewise, a deletion may be introduced into a DNA sequence. Likewise, amino acid substitutions, additions and deletions may be introduced into a protein.

[0066] A glycosylation site is a place on a molecule at which sugars may be added. The addition of sugars (glycosylation) may occur at introduced sites in modified proteins such as at amino acid residue 37 of the modified human cystatin C.

[0067] A glycosylation site in a protein is denoted by the general formula X (#) Z, where X =the native amino acid that has been removed, and # =the position of that amino acid in the protein, and Z =the substituted amino acid that has been inserted in place of X. For instance, Ala (37) Ser denotes that Alanine at position 37 has been removed and replaced with Serine; likewise, for one-letter code, A (31) N denotes that Alanine at position 31 has been removed and replaced with Asparagine. The numbering system of cystatin proteins in this text reflects that shown in the accompanying figures. While alternative numbering systems are possible, the nomenclature use herein is intended to identify a particular amino acid residue, rather than any residue that happens to be at a given distance from an arbitrary point on an amino acid sequence.

[0068] An organism refers to any organism of any kingdom, phylum, class, order, family, genus or species.

[0069] A pathology refers to a state that is measurably or detectably at variance with normal, healthy, non-disease-state physiology.

[0070] A cancer refers to any neoplastic transformation or any tissue that has undergone neoplasia and includes solid, non-solid, benign and malignant transformed tissue of both plants and animals.

[0071] A protease-mediated pathology is any pathology that is caused in part or in whole by the action of a protease, for instance, the proteolytic invasion of human tissue by a cancer cell, the proteolytic destruction of human tissue by a bacterium or a protozoan (e.g., Pseudomonas or Leishmania) and the tissue destruction of a plant by a fungus (e.g., Phytopthora infestans) are all examples of protease-mediated pathologies.

[0072] To say that a polynucleotide (or a gene or genome) is recombinant means that it has been altered by the addition at some site of non-native nucleic acids or nucleotides, i.e., nucleotides that are not normally found in the particular polynucleotide, or that are not normally found at that site. For instance, a human cystatin gene that has been altered by nucleotide substitution is said to be recombinant. Likewise, a recombinant protein is a protein that is the product of a recombinant polynucleotide and that contains non-native amino acid residues.

[0073] An isolated nucleic acid has been substantially separated or purified away from other nucleic acid sequences in the cell of the organism in which the nucleic acid naturally occurs, i.e., other chromosomal and extrachromosomal DNA and RNA. The term isolated thus encompasses nucleic acids purified by standard nucleic acid purification methods. The term also embraces nucleic acids prepared by recombinant expression in a host cell as well as chemically synthesized nucleic acids.

[0074] Nucleic acid probes and primers may readily be prepared based on the nucleic acid sequences provided by this invention. A probe comprises an isolated nucleic acid attached to a detectable label or reporter molecule. Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. Methods for labeling and guidance in the choice of labels appropriate for various purposes are well known in the field of molecular biology. Primers are short nucleic acids, preferably DNA oligonucleotides 15 nucleotides or more in length, which are annealed to a complementary target DNA strand by nucleic acid hybridization to form a hybrid between the primer and the target DNA strand, then extended along the target DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR) or other nucleic-acid amplification methods known in the art. Probes and primers as used in the present invention preferably comprise at least 15 nucleotides of the nucleic acid sequences that encode a modified cystatin protein. In order to enhance specificity, longer probes and primers may also be employed, such as probes and primers that comprise 20, 30 or 40 consecutive nucleotides of the disclosed nucleic acid sequences. Methods for preparing and using probes and primers are described in a number of reference works, for example Sambrook et al. (1989) (26); Ausubel et al., (1987) (25); Innis et al., (1990) (27). PCR primer pairs can be derived from a known sequence, for example, by using computer programs intended for that purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge, Mass.).

[0075] A first nucleic acid sequence is operably linked with a second nucleic acid sequence when the first nucleic acid sequence is placed in a functional relationship with the second nucleic acid sequence. For instance, a promoter is operably linked to a coding sequence if the promoter affects the transcription or expression of the coding sequence. Generally, operably linked DNA sequences are contiguous and, where necessary to join two protein coding regions, in the same reading frame.

[0076] The term purified peptide does not require absolute purity, rather, it is intended as a relative term. Thus, for example, a purified cystatin preparation is one in which cystatin is enriched compared to the cystatin in its natural environment, i.e., within a cell. A purified preparation of a cystatin is prepared such that the cystatin represents at least 20% of the portion of the protein content of the preparation. Preparations comprising at least 50%, 75 % or at least 90% cystatin (expressed as a percentage of total protein) may be desirable for certain applications.

[0077] B. General Methods

[0078] The present invention utilizes standard laboratory practices for the cloning, manipulation and sequencing of nucleic acids, purification and analysis of proteins and other molecular biological and biochemical techniques, unless otherwise stipulated. Such techniques are explained in detail in standard laboratory manuals such as Sambrook et al., (1989)(25) and Ausubel et al., (1987)(26).

[0079] (1) Production, cloning and expression of modified cystatins

[0080] A modified nucleotide sequence that codes for a cystatin protein having at least one glycosylation site may be produced by making synthetic sequences using a commercial polynucleotide synthesizer. The synthetic nucleotides are designed on the basis of the known nucleotide sequence of the cystatin gene and the known codon usage for the cell type in which the nucleotides are to be expressed. A suitable target site for the introduction of a glycosylation site is identified based on the primary amino acid sequence of the cystatin to be modified. A nucleotide sequence is designed that will code for a peptide that includes such a site. The synthetic nucleotide sequence may then be synthesized as described herein. A series of anti-parallel and complementary nucleotides may be synthesized and annealed together to form synthetic double-stranded nucleotide fragment. These fragments may then be ligated together to produce a contiguous double-stranded nucleotide that codes for a particular cystatin having one or more engineered glycosylation sites. The synthetic nucleotides may be amplified in vitro using the Polymerase Chain Reaction (PCR) (27) so that many copies of the nucleotide are available for cloning into a prokaryotic cloning vector (26) or into an expression vector, as explained below.

[0081] Additional glycosylation sites may be introduced using site-directed mutagenesis to add, delete or substitute particular amino acid residues. Various standard techniques are known to carry out site-directed mutagenesis (26, chapter 15), and commercial kits are also available such as the QUICKCHANGE™ mutagenesis site-directed mutagenesis kit (STRATAGENE™, CA). Nucleotides may be modified or synthesized so as to include restriction enzyme sites that can be used for cloning.

[0082] The modified nucleotide may be cloned into a standard prokaryotic cloning vector, for example pBR322, pUC18 or pUC19 (26, chapter 1). The sequence of the cloned nucleotide may be checked by sequencing using standard methods (26, chapters 1 and 13).

[0083] Modified nucleotides may be cloned into an expression vector that allows protein production in a particular cell type. Since the proteins of the invention are glycosylated, it is required that the cell type in which the proteins are expressed can readily carry out post-translational modification including glycosylation. Typically, a eukaryotic cell is used that glycosylates peptides at the Asn-X-Ser/Thr motif. Yeast cells are commonly used for such a purpose. Standard cloning techniques may be used (26, chapter 9). Such expression vector/cell systems are well known and commercially available and include vector/cell combinations that carry out post-translational modifications required for the proper expression of glycosylated eukaryotic proteins. Various yeast strains and yeast-derived vectors are commonly used for this type of expression, for instance, Pichia pastoris expression systems that may be used to practice the present invention may be obtained from INVITROGEN™. Such systems include suitable Pichia pastoris strains, vectors, reagents, transformants, sequencing primers and media. Available strains include a GS115 his 4 deficient strain, a KM71 aoxl deficient strain, a GS115 His⁺ Mut⁻ strain for extracellular expression and a His⁺ Mut⁻ strain for intracellular expression (33).

[0084] Non-yeast eukaryotic vectors may equally be used for expression of the modified nucleotides. Mammalian vector / host cell systems that contain genetic and cellular control elements capable of carrying out transcription, translation and post-translational modification are well known in the art. Examples of such systems are the well known Baculovirus system, the Ecdysone-inducible mammalian expression system that uses regulatory elements from Drosophila melanogaster to allow control of gene expression, and the Sindbis viral expression system that allows high level expression in a variety of mammalian cell lines, which are available from INVITROGEN™.

[0085] The cloned expression vector may then be transformed into a particular cell type and the nucleotide expressed. Many different types of cell may be used to express the modified nucleic acid molecules. Examples of such cells include cells of yeasts, fungi, insects and humans and plants, including transformed and non-transformed cells. For instance, common mammalian cells that could be used for the invention include human HeLa cells, SW-527 human cells (ATCC deposit #7940), WISH cells (ATCC deposit #CCL-25), Daudi cells (ATCC deposit #CCL-213), Mandin-Darby bovine kidney cells (ATCC deposit #CCL-22) and Chinese Hamster ovary cells (ATCC deposit #CRL-2092). Common yeast cells include Pichia pastoris (ATCC deposit #201178) and Saccharomyces cerevisiae (ATCC deposit #46024). Insect cells include cells from Drosophila melanogaster (ATCC deposit #CRL-10191), the cotton bollworm (ATCC deposit #CRL-9281) and from Trichoplusia ni egg cell homoflagelates. Fish cells that may be used include those from rainbow trout (ATCC deposit #CLL-55), salmon (ATCC deposit #CRL-1681) and Zebrafish (ATCC deposit #CRL-2147). Amphibian cells that may be used include those of the Bullfrog, Rana catesbelana (ATCC deposit #CLL-41). Reptile cells that may be used include those from Russell's Viper (ATCC deposit #CCL-140). Plant cells that could be used include Chlamydomonas cells (ATCC deposit #30485), Arabidopsis cells (ATCC deposit #54069) and tomato plant cells (ATCC deposit #54003). Many of these cell types are commonly used and are available from the ATCC as well as from commercial suppliers such as PHARMACIA™ and INVITROGEN™.

[0086] Expressed protein may be accumulated within a cell or may be secreted from the cell. Such expressed protein may then be collected and purified. This protein may then be characterized for activity and heat stability and may be used to practice the methods of the invention.

[0087] The amino acid sequences of the cystatins (FIGS. 1-12) are shown in their mature form without a signal peptide. The signal peptide is cleaved off during post-translational modification to produce the mature peptide. The invention may be equally practiced with cystatin peptides which retain the signal peptide.

[0088] (2) Measurement of protease activity and heat resistance

[0089] Protease inhibition activity of modified cystatins is assayed by measuring the reduction in activity of the protease papain. A substrate is used that releases nitroaniline into the reaction medium. The amount of nitroaniline released is determined by measuring light absorption by the solution. This assay is described in detail in Example 4, below.

[0090] Heat resistance of the modified cystatin is determined by heating a solution containing a known amount of modified cystatin, cooling it, and adding it to a mixture containing a known amount of protease and a known amount of protein. Protease activity is measured by determining turbidity of the mixture after a set time and also by using the nitroaniline assay as described below.

[0091] (3) Methods of using modified cystatins

[0092] The modified cystatins of the invention may be used to inhibit proteolysis of a protein substrate in generally the same way as non-modified cystatins are used. Such uses include inhibition of proteolysis in food processing (4), therapeutic treatment of viral disease such as those caused by Herpes Simplex (11, 30), picornaviruses (20) and coronavirus (9).

[0093] The present invention includes a general method of inhibiting proteolysis of a substrate by providing the modified cystatin and by contacting this cystatin with the substrate. Such a substrate may be any substrate containing protein. Such a modified cystatin may be used in food processing applications, in agricultural applications and in human and non-human medical applications. For instance, a protein substrate such as minced Pacific Whiting (as used for surimi production) may be treated with a modified cystatin of the invention to inhibit proteolysis caused by the release of endogenous protease from the fish tissue. Suggested concentrations for the application of modified cystatin are, for instance, for surimi processing, from about 1 μg/g to 100 μg/g of surimi.

[0094] The present invention also includes methods of using a modified cystatin for the inhibition of proteases for therapeutic purposes. For instance, the cystatins of the invention may be used to inhibit tissue destruction and invasion by pathogens such as staphylococci and streptococci. Streptococcus is the etiological agent of the common skin disease impetigo, and certain particularly rapacious “flesh eating” strains of Staphylococcus and Streptococcus have recently received much media attention, particularly because of their multiple antibiotic resistance.

[0095] The effectiveness of cystatins of the invention in preventing bacterial invasion of host-tissues can be measured, for example, by the method of Betts and Finlay (32). This method can be used to determine tissue invasion of green monkey kidney cells (ATCC deposit #CCL-70) by bacteria such as E. coli and Salmonella typhimurium.

[0096] For medical applications the modified cystatin may be administered topically or systemically. The cystatin may be formulated with a carrier or pharmacologically acceptable excipient. For instance, the cystatin may be mixed with a carrier such as a petroleum-based or lanolin-based oil to form a gel, and administered topically at the site of infection, thereby contacting the modified cystatin with the protease and preventing tissue destruction.

[0097] The cystatins of the present invention may also be administered systemically, either orally, intravenously, sub-cutaneously, transdermally or by other methods. For instance, the cystatin of the present invention may be formulated into a tablet or solution form and administered orally. Formulation of drugs into pills and tablets is well known in the art. Systemic administration may be used to treat infections that cannot be treated topically such as cancer and systemic viral infections of humans and animals.

[0098] For topical medical treatment, concentrations of modified cystatin in a cream or ointment may be from about 1 ng to 100 mg per gram total weight of ointment, or may be from about 1 μg to 1 mg total weight, or may be from about 10 μg to 100 μg total weight. For ingested medications, the amount of modified cystatin per Kg mass of a patient may be from about 1 ng to 100 mg, or may be from about 1 μg to 1 mg, or may be from about 10 μg to 100 μg.

[0099] The cystatins of the present invention may be used to inhibit proteolysis caused by pathogens of crop plants and fruits. For instance, the modified cystatins may be applied to the surface of fruit or crops to inhibit proteases produced by plant pathogens such as fungi, for instance, Botrytis. The modified cystatin may, for instance, be sprayed or painted onto crops, either in a pure form or diluted in a carrier liquid such as water.

[0100] The cystatins of the invention show superior heat resistance, maintaining a high degree of activity after exposure to elevated temperatures. These heat resistant cystatins may be particularly useful where sterility is required, such as in medical applications where it is generally desirable for medicines to be uncontaminated with biological organisms. For instance, the cystatins of the present invention may be formulated into topical ointments or pills which may then be packaged and sterilized and administrated therapeutically. Also, food processing, such as surimi production, involves elevated temperatures. The cystatins of the invention are useful in such processes due to their enhanced resistance.

[0101] C. Production of Modified Human Cystatin C

[0102] Glycosylation of one or more amino acid residues at specific sites in the human cystatin C peptide increases heat stability of the cystatin without substantially inhibiting functionality. Glycosylation at other, inappropriate sites, may drastically decrease or destroy the protein's inhibitory function. The invention identifies three amino acid residues of human cystatin C that may be modified to introduce a glycosylation site; one or more of these sites may be modified to produce a cystatin protein having superior properties, such as enhanced heat stability.

[0103] The amino acid sequences for native and modified human cystatin C are shown in FIG. 1. Amino acid substitutions to produce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more of these amino acid substitutions may be present in a modified cystatin protein of the present invention. The cDNA sequence for native human cystatin C is shown in Seq. ID No. 1.

[0104] Human cystatin C protein may be modified to introduce glycosylation sites (Asn-X-Ser/Thr) at amino acid residues 35, 36 or 79 by the introduction of amino acids substitutions at positions 37, 36 or 81, respectively. One or more of the following specific substitutions may be made: A (37) S, A (37) T, K (36) N, D (81) S, D (81) T.

[0105] It should be noted that the peptide of cystatin C in FIG. 1 is shown in its mature form without a signal peptide. The signal peptide is cleaved off during post-translational modification to produce the mature peptide. The invention may be equally practiced with cystatin peptides which retain the signal peptide.

[0106] The same is so for all the other cystatins, FIGS. 2-12.

[0107] D. Production of Modified Human Cystatin S

[0108] The amino acid sequences for native and modified human cystatin S are shown in FIG. 2. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The cDNA sequence for native human cystatin S is shown in Seq. ID No. 3.

[0109] The human cystatin S protein may be modified by the introduction of the following amino acid substitutions: A (31) N, K (37) N, A (38) S, A (38) T, L (81) N, D (82) T, and D (82) S.

[0110] E. Production of Modified Human Cystatin SN

[0111] The amino acid sequences for native and modified human cystatin SN are shown in FIG. 3. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The cDNA sequence for native human cystatin SN is shown in Seq. ID No. 5.

[0112] The human cystatin SN protein may be modified to introduce one or more glycosylation sites by the introduction of one or more of the following amino acid substitutions: A (31) N, A (38) S, A (38) T, K (37) N, L (81) N, D (82) S, D (82) T.

[0113] F. Production of Modified Human Cystatin SA

[0114] The amino acid sequences for native and modified human cystatin SA are shown in FIG. 4. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The cDNA sequence for native human cystatin SA is shown in Seq. ID No. 7.

[0115] The human cystatin SA protein may be modified by the introduction of the following amino acid substitutions: V (31) N, A (38) S, A (38) T, K (37) N, D (82) S, D (82) T, L (81) N.

[0116] G. Production of Modified Human Cystatin D

[0117] The amino acid sequences for native and modified human cystatin D are shown in FIG. 5. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The cDNA sequence for native human cystatin D is shown in Seq. ID No. 9.

[0118] The human cystatin D protein may be modified by the introduction of the following amino acid substitutions: A (31) N, V (38) S, V (38) T, D (42) S, D (42) T, D (83) S, D (83) T, P (86) S, P (86) T, Q (90) S, Q (90) T, Y (44) N.

[0119] H. Production of Modified Human Cystatin M

[0120] The amino acid sequences for native and modified human cystatin M are shown in FIG. 6. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The cDNA sequence for native human cystatin M is shown in Seq. ID No. 11.

[0121] The human cystatin M protein may be modified by the introduction of the following amino acid substitutions: V (35) N, M (40) N, G (41) S, G (41) T, S (42) N, 1 (45) S, 1 (45) T, R (78) N, R (81) N, D (88) N, L (89) N.

[0122] I. Production of Modified Human Cystatin E

[0123] The amino acid sequences for native and modified human cystatin E are shown in FIG. 7. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The cDNA sequence for native human cystatin E is shown in Seq. ID No. 13.

[0124] The human cystatin E protein may be modified by the introduction of the following amino acid substitutions: V (28) N, M (33) N, G (34) S, G (34) T, S (35) N, I (38) S, 1 (38) T, D (81) N, L (82) N.

[0125] J. Production of Modified Egg White Cystatin

[0126] The amino acid sequences for native and modified egg white cystatin are shown in FIG. 8. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The cDNA sequence for native egg white cystatin is shown in Seq. ID No. 15.

[0127] The egg white cystatin protein may be modified by the introduction of the following amino acid substitutions: A (35) S, A (35) T, R (34) N, K (39) S, K (39) T, K (39) N, Y (40) N, L (78) N, K (91) N, Y (92) N.

[0128] K. Production of Modified Bovine Cystatin

[0129] The amino acid sequences for native and modified bovine cystatin are shown in FIG. 9. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The cDNA sequence for native bovine cystatin is shown in Seq. ID No. 23.

[0130] The bovine cystatin protein may be modified by the introduction of the following amino acid substitutions: A (29) N, R (36) S, R (36) T, K (35) N, A (40) S, A (40) T, Y (41) N, L (79) N, D (80) S, D (80) T, P (88) S, P (88) T.

[0131] L. Production of Modified Carp Cystatin

[0132] The amino acid sequences for native and modified carp cystatin are shown in FIG. 10. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The cDNA sequence for native carp cystatin is shown in Seq. ID No. 17.

[0133] The carp cystatin protein may be modified by the introduction of the following amino acid substitutions: Q (31) S, Q (31) T, G (30) N, A (35) S, A (35) T, K (39) N, K (91) N.

[0134] M. Production of Modified Trout Cystatin

[0135] The amino acid sequences for native and modified trout cystatin are shown in FIG. 11. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The cDNA sequence for native trout cystatin is shown in Seq. ID No. 21.

[0136] The trout cystatin protein was modified by the introduction of the following amino acid substitutions: K (29) N, K (30) S, K (30) T, M (34) T, M (34) S, K(39)N, K (88) N.

[0137] N. Production of Modified Chum Salmon Cystatin

[0138] The amino acid sequences for native and modified chum salmon cystatin are shown in FIG. 12. Amino acid substitutions to introduce glycosylation sites are shown directly beneath the amino acids of the native protein. One or more such amino acid substitutions may be present in the protein of the present invention. The CDNA sequence for native chum salmon cystatin is shown in Seq. ID No. 19.

[0139] The chum salmon cystatin protein may be modified by the introduction of the following amino acid substitutions: K (29) N, K (30) S, K (30) T, M (34) S, M (34) T, K (88) N.

EXAMPLE 1 Synthesis of nucleotide sequences that code for modified human cystatin C

[0140] Cloning was done using the p UC19 cloning vector and E. coli using standard gene cloning techniques (26). The yeast strain Pichia pastons KM71 was used to express mammalian genes and constructs. T4 DNA ligase, restriction enzymes, the 7-DEAZA sequencing kit and blunting kit were all purchased from TAKARA SHUZO™ of Kyoto, Japan. The oligonucleotide in vitro mutagenesis system (version 2) was purchased from AMERSHAM™ International. CM TOYOPEARL™ 650M resin was purchased from TOSOH™ of Tokyo. Concanavalin A-sepharose and a-methylmannoside were purchased from PHARMACIA™ and from WAKO™ of Tokyo, respectively. Sephadex-G50 was purchased from PHARMACIA™. M13mpl9 was used as a vector for CDNA construction.

[0141] The modified nucleotide sequences were made in two steps.

[0142] First, a synthetic double-stranded DNA was constructed that codes for human cystatin C, modified to have a glycosylation site at residue number 79. This DNA was made by chemically synthesizing four oligonucleotides using an automated oligonucleotide synthesizer (SEQ. I.D. No. 25, SEQ. I.D. No. 26, SEQ. I.D. No. 27 and SEQ. I.D. No. 28). The oligonucleotides were chosen on the basis of the known nucleotide sequence of the native human cystatin C gene and the known codon usage of P. pastoris (Table 1).

[0143] DNA was sequenced using the Sanger method (26, chapter 13). Polypeptides were sequenced using the Edman degradation method in a gas-phase protein automated sequencer (SHIMADZU™ model PSQ2). TABLE I Amino acid Codon usage Glycine GGT or GGA Glutamic acid GAG of GAA Aspartic acid GAG or GAT Valine GTT or GTG Alanine GCT or GGG Arginine AGA or GGT Serine TCT or TCC Lysine AAG Asparagine AAC Methionine ATG Isoleucine ATT or ATC Threonine ACT or ACC Tryptophan TGG Cysteine TGT Tyrosine TAG Leucine TTG or CTG Phenylalanine TTC Glutamine CAA or CAG Histidine CAC or CAT Proline CCA or CCT

[0144] The complementary pairs of the four oligonucleotides were annealed together and the resulting double-stranded fragments were ligated using T4 ligase. The resulting synthetic open reading frame contained an Xhol site at the 5′ end and an Xba site at the 3′ end that were used for cloning. The gene product was ligated into pUCl9 (26, chapter 1) and sequenced in both directions to check that the sequence was as predicted.

[0145] Second, an N-glycosylation site was introduced at either residue 35 or 36 using the QUICKCHANGE™ site-directed mutagenesis kit (STRATAGENE™, CA) according to manufacturers instructions and using the forward and reverse primers shown in SEQ. I.D. Nos. 29-32.

EXAMPLE 2 Transformation and clone selection for production of modified human cystatin C

[0146] The yeast expression plasmids pYG- 100 (20) and pPICZ a-C containing Saccharomyces cerevissiae a factor secretion signal and alcohol oxidase (AOX1) gene promoter were used to express the proteins of the invention. P. pastoris strain KM71 was transformed using the Pichia EASYCOM™ transformation system (INVITROGEN™, CA). Zeocin-resistant transformants were selected from yeast extract peptone dextrose sorbitol medium (YPDS) agar plates containing the zeocin. PCR sequencing was used to confirm insertion of the cystatin C gene in Pichia clones.

EXAMPLE 3 Expression and purification of modified human cystatin C

[0147] The Pichia transformants were incubated in yeast minimal medium (YMM). The Pichia transformants were grown at 30° C. for one day in 5 mL of YMM, and then subcultured at 30° C. for four days in 500 mnl of fresh YMM. 100% methanol was added into the fresh YMM to a final concentration of 0.5% methanol every 24 hours to maintain induction of the cystatin gene.

[0148] Recombinant modified human cystatin C was secreted in the Pichia culture media. The extracellular proteins were collected using an ultrafiltration system with 10,000 MW cut-off (PELLICON™ cassette filter, MILLIPORE™, Bedford, Mass.). The crude proteins thus recovered were applied to a column of Q-SEPHAROSE FAST FLOW™ (PHARMACIA™, Upsala, Sweden) equilibrated with a linear gradient of 0-0.5 M NaCl in 20 mM Tris-HCI buffer (pH7.5). The fraction including the cystatin was determined by the inhibitory activity against papain as described below. The fraction was applied to a column of sephacryl S-100 HR (PHARMACIA™) equilibrated with 0.15 M NaCI-20 mM Tris-HCI buffer (pH7.5). The fraction which showed the inhibitory activity was collected.

EXAMPLE 4 Assay of inhibitory activity and heat resistance of modified human cysain C

[0149] Cystatin activity was assayed by measuring papain inhibitory activity using Na-Benzoyl-DL-Arg-p-Nitroanilide (Bz-Arg-NA) (34). 0.1 ml of cystatin sample and 0.1 ml of papain solution (0.5 mg/ml) were pipetted into 0.1 mL of 50 mM Tris-HCI buffer (pH7.5) containing 100 mM Bz-Arg-NA, 2 mM EDTA, and 5 mM cystein. The solution was incubated for 25 min at 37° C. The reaction was stopped with 0.2 ml of 30% acetic acid, and the nitroaniline liberated by enzymatic activity is quantified by measurement of light absorption of the solution at 410nm. The inhibitory activity was expressed as the amount of enzyme (mg) inhibited by 1 mg of inhibitor (U/mg).

[0150] Heat resistance assays were performed by heating a solution containing a known amount of cystatin to a controlled temperature for a controlled time; cooling the solution; mixing the cooled cystatin solution with a known amount of papain and protein substrate, and measuring turbidity of the mixture. The recombinant cystatins were heated to 95° C. at a rate of 1C./min for 30° C. in 50 mM sodium phosphate buffer (pH 7.5). Protein concentration was 1 mg/ml.

[0151] At preset temperatures, each heated sample was transferred into a cuvette and the turbidity measured at 500 nm. The residual papain-inhibiting activity of the heated samples was also measured as described above. This procedure was repeated in triplicate.

[0152] The above examples are provided by way of illustration only and are in no way intended to limit the scope of the invention. One of skill in the art will see that the invention may be modified in various ways without departing from the spirit or principle of the invention. We claim all such modifications.

References

[0153] 1. Keppler et al., (1993) In Proteases and Cancer Colloquium Queen's Univ., Belfast, 22, 43-49

[0154] 2. Kuopio et al., (1998) Cancer Research 58 (3), 432-436

[0155] 3. Seymour et al., (1994) Journal of Agricultural and Food Chemistry 42, 2421

[0156] 4. Weerasinghe et al., (1996) J. Agric. Food Chem. 44, 2584-2590

[0157] 5. Izquierdo-Pulido et al., (1994) J. Agric Food Chem 42, 616-622

[0158] 6. Yamashita et al., (1991) Nippon Suisan Gakkaishi 57 (10), 1917-1922

[0159] 7. Barrett et al., The Biochemical Journal (1986) Letters 236 (1), 311-312

[0160] 8. Turk et al., (1991) FEBS 285 (2) 213-219

[0161] 9. Collins et al., (1998) Oral Microbiol Immunol. 13 (1), 59-61

[0162] 10. Barrett (1987) TIBS 12, 193-196

[0163] 11. Grubb et al., (1995) U.S. Pat. No. 5,432,264

[0164] 12. Korant et al., (1985) Biochem. and Biophys. Res. Comm. 127 (3), 1072-1076

[0165] 13. Abrahamson et al., (1988) FEBS Letters 236 (1), 14-18

[0166] 14. Nakamura et al., (1993) J. of Biol. Chem. 268 (17), 12706-12712

[0167] 15. Saitoh et al., (1998) Arch Biochem Biophys 352 (2), 199-206

[0168] 16. Atkinson et al., (1996) PCT Patent No. WO 96/116173

[0169] 17. An et al., (1994) Journal of Food Science 59 (2), 277

[0170] 18. Morrissey et al., (1993) Journal of Food Science 58 (5), 1050

[0171] 19. An et al., (1994) Journal of Food Science 59 (5), 1013

[0172] 20. Turk et al., (1990) U.S. Pat. No. 4,902,509

[0173] 12. Yamashita et al., (1990) Nippon Suisan Gakkashi 56 (8) 1271-1277

[0174] 22. Saeki et al., (1995) Journal of Food Science 60 (5),

[0175] 23. Yamashita et al., (1991) Comp. Biochem. Physiol. 100A (3) 749-751

[0176] 24. Nakamura et al., (1993) FEBS 328 (3) 259-262

[0177] 25. Ausubel et al., (1987). Current Protocols in Molecular Biology, ed. Greene Publishing and Wiley-Interscience: New York (with periodic updates)

[0178] 26. Sarnbrook et al., (1989). Molecular Cloning: A Laboratory Manual, 2nd ed., vol. 1- 3, ed. Cold Spring Harbor Laboratory Press: Cold Spring Harbor, N.Y.

[0179] 27. Innis et al., (1990). PCR Protocols: A Guide to Methods and Applications, Academic Press: San Diego

[0180] 28. Nakamura et al., (1996) FEBS Letters 383 251-254

[0181] 29. An et al., (1995) J. Agric. Food Chem. 43 327-330

[0182] 30. Bjorck et al., J. Virol. 64 (2) 941-943

[0183] 31. Wyatt, R. G. et al., (1980) Science, 207, 189-191

[0184] 32. Betts et al., (1992) Can. J. Microb. 38 852-857

[0185] 33. Invitrogen Product Catalogue, 1998. Invitrogen, Carlsbad, Calif.

[0186] 34. Barrett et al., (1981) Methods. Enzymol. 80: 771-778

1 32 1 363 DNA Homo sapiens CDS (1)..(363) 1 tcc agt ccc ggc aag ccg ccg cgc ctg gtg gga ggc ccc atg gac gcc 48 Ser Ser Pro Gly Lys Pro Pro Arg Leu Val Gly Gly Pro Met Asp Ala 1 5 10 15 agc gtg gag gag gag ggt gtg cgg cgt gca ctg gac ttt gcc gtc ggc 96 Ser Val Glu Glu Glu Gly Val Arg Arg Ala Leu Asp Phe Ala Val Gly 20 25 30 gag tac aac aaa gcc agc aac gac atg tac cac agc cgc gcg ctg cag 144 Glu Tyr Asn Lys Ala Ser Asn Asp Met Tyr His Ser Arg Ala Leu Gln 35 40 45 gtg gtg cgc gcc cgc aag cag atc gta gct ggg gtg aac tac ttc ttg 192 Val Val Arg Ala Arg Lys Gln Ile Val Ala Gly Val Asn Tyr Phe Leu 50 55 60 gac gtg gag ctg ggc cga acc acg tgt acc aag acc cag ccc aac ttg 240 Asp Val Glu Leu Gly Arg Thr Thr Cys Thr Lys Thr Gln Pro Asn Leu 65 70 75 80 gac aac tgc ccc ttc cat gac cag cca cat ctg aaa agg aaa gca ttc 288 Asp Asn Cys Pro Phe His Asp Gln Pro His Leu Lys Arg Lys Ala Phe 85 90 95 tgc tct ttc cag atc tac gct gtg cct tgg cag ggc aca atg acc ttg 336 Cys Ser Phe Gln Ile Tyr Ala Val Pro Trp Gln Gly Thr Met Thr Leu 100 105 110 tcg aaa tcc acc tgt cag gac gcc tag 363 Ser Lys Ser Thr Cys Gln Asp Ala 115 120 2 120 PRT Homo sapiens 2 Ser Ser Pro Gly Lys Pro Pro Arg Leu Val Gly Gly Pro Met Asp Ala 1 5 10 15 Ser Val Glu Glu Glu Gly Val Arg Arg Ala Leu Asp Phe Ala Val Gly 20 25 30 Glu Tyr Asn Lys Ala Ser Asn Asp Met Tyr His Ser Arg Ala Leu Gln 35 40 45 Val Val Arg Ala Arg Lys Gln Ile Val Ala Gly Val Asn Tyr Phe Leu 50 55 60 Asp Val Glu Leu Gly Arg Thr Thr Cys Thr Lys Thr Gln Pro Asn Leu 65 70 75 80 Asp Asn Cys Pro Phe His Asp Gln Pro His Leu Lys Arg Lys Ala Phe 85 90 95 Cys Ser Phe Gln Ile Tyr Ala Val Pro Trp Gln Gly Thr Met Thr Leu 100 105 110 Ser Lys Ser Thr Cys Gln Asp Ala 115 120 3 366 DNA Homo sapiens CDS (1)..(366) 3 tcg agc tcc aag gag gag aat agg ata atc cca ggt ggc atc tat gat 48 Ser Ser Ser Lys Glu Glu Asn Arg Ile Ile Pro Gly Gly Ile Tyr Asp 1 5 10 15 gca gac ctc aat gat gag tgg gta cag cgt gcc ctt cac ttc gcc atc 96 Ala Asp Leu Asn Asp Glu Trp Val Gln Arg Ala Leu His Phe Ala Ile 20 25 30 agc gag tac aac aag gcc acc gaa gat gag tac tac aga cgc ccg ctg 144 Ser Glu Tyr Asn Lys Ala Thr Glu Asp Glu Tyr Tyr Arg Arg Pro Leu 35 40 45 cag gtg ctg cga gcc agg gag cag acc ttt ggg ggg gtg aat tac ttc 192 Gln Val Leu Arg Ala Arg Glu Gln Thr Phe Gly Gly Val Asn Tyr Phe 50 55 60 ttc gac gta gag gtg ggc cgc acc ata tgt acc aag tcc cag ccc aac 240 Phe Asp Val Glu Val Gly Arg Thr Ile Cys Thr Lys Ser Gln Pro Asn 65 70 75 80 ttg gac acc tgt gcc ttc cat gaa cag cca gaa ctg cag aag aaa cag 288 Leu Asp Thr Cys Ala Phe His Glu Gln Pro Glu Leu Gln Lys Lys Gln 85 90 95 tta tgc tct ttc gag atc tac gaa gtt ccc tgg gag gac aga atg tcc 336 Leu Cys Ser Phe Glu Ile Tyr Glu Val Pro Trp Glu Asp Arg Met Ser 100 105 110 ctg gtg aat tcc agg tgt caa gaa gcc tag 366 Leu Val Asn Ser Arg Cys Gln Glu Ala 115 120 4 121 PRT Homo sapiens 4 Ser Ser Ser Lys Glu Glu Asn Arg Ile Ile Pro Gly Gly Ile Tyr Asp 1 5 10 15 Ala Asp Leu Asn Asp Glu Trp Val Gln Arg Ala Leu His Phe Ala Ile 20 25 30 Ser Glu Tyr Asn Lys Ala Thr Glu Asp Glu Tyr Tyr Arg Arg Pro Leu 35 40 45 Gln Val Leu Arg Ala Arg Glu Gln Thr Phe Gly Gly Val Asn Tyr Phe 50 55 60 Phe Asp Val Glu Val Gly Arg Thr Ile Cys Thr Lys Ser Gln Pro Asn 65 70 75 80 Leu Asp Thr Cys Ala Phe His Glu Gln Pro Glu Leu Gln Lys Lys Gln 85 90 95 Leu Cys Ser Phe Glu Ile Tyr Glu Val Pro Trp Glu Asp Arg Met Ser 100 105 110 Leu Val Asn Ser Arg Cys Gln Glu Ala 115 120 5 366 DNA Homo sapiens CDS (1)..(366) 5 tgg agc ccc aag gag gag gat agg ata atc ccg ggt ggc atc tat aac 48 Trp Ser Pro Lys Glu Glu Asp Arg Ile Ile Pro Gly Gly Ile Tyr Asn 1 5 10 15 gca gac ctc aat gat gag tgg gta cag cgt gcc ctt cac ttc gcc atc 96 Ala Asp Leu Asn Asp Glu Trp Val Gln Arg Ala Leu His Phe Ala Ile 20 25 30 agc gag tat aac aag gcc acc aaa gat gac tac tac aga cgt ccg ctg 144 Ser Glu Tyr Asn Lys Ala Thr Lys Asp Asp Tyr Tyr Arg Arg Pro Leu 35 40 45 cgg gta cta aga gcc agg caa cag acc gtt ggg ggg gtg aat tac ttc 192 Arg Val Leu Arg Ala Arg Gln Gln Thr Val Gly Gly Val Asn Tyr Phe 50 55 60 ttc gac gta gag gtg ggc cga acc ata tgt acc aag tcc cag ccc aac 240 Phe Asp Val Glu Val Gly Arg Thr Ile Cys Thr Lys Ser Gln Pro Asn 65 70 75 80 ttg gac acc tgt gcc ttc cat gaa cag cca gaa ctg cag aag aaa cag 288 Leu Asp Thr Cys Ala Phe His Glu Gln Pro Glu Leu Gln Lys Lys Gln 85 90 95 ttg tgc tct ttc gag atc tac gaa gtt ccc tgg gag aac aga agg tcc 336 Leu Cys Ser Phe Glu Ile Tyr Glu Val Pro Trp Glu Asn Arg Arg Ser 100 105 110 ctg gtg aaa tcc agg tgt caa gaa tcc tag 366 Leu Val Lys Ser Arg Cys Gln Glu Ser 115 120 6 121 PRT Homo sapiens 6 Trp Ser Pro Lys Glu Glu Asp Arg Ile Ile Pro Gly Gly Ile Tyr Asn 1 5 10 15 Ala Asp Leu Asn Asp Glu Trp Val Gln Arg Ala Leu His Phe Ala Ile 20 25 30 Ser Glu Tyr Asn Lys Ala Thr Lys Asp Asp Tyr Tyr Arg Arg Pro Leu 35 40 45 Arg Val Leu Arg Ala Arg Gln Gln Thr Val Gly Gly Val Asn Tyr Phe 50 55 60 Phe Asp Val Glu Val Gly Arg Thr Ile Cys Thr Lys Ser Gln Pro Asn 65 70 75 80 Leu Asp Thr Cys Ala Phe His Glu Gln Pro Glu Leu Gln Lys Lys Gln 85 90 95 Leu Cys Ser Phe Glu Ile Tyr Glu Val Pro Trp Glu Asn Arg Arg Ser 100 105 110 Leu Val Lys Ser Arg Cys Gln Glu Ser 115 120 7 366 DNA Homo sapiens CDS (1)..(366) 7 tgg agc ccc cag gag gag gac agg ata atc gag ggt ggc atc tat gat 48 Trp Ser Pro Gln Glu Glu Asp Arg Ile Ile Glu Gly Gly Ile Tyr Asp 1 5 10 15 gca gac ctc aat gat gag cgg gta cag cgt gcc ctt cac ttt gtc atc 96 Ala Asp Leu Asn Asp Glu Arg Val Gln Arg Ala Leu His Phe Val Ile 20 25 30 agc gag tat aac aag gcc act gaa gat gag tac tac aga cgc ctg ctg 144 Ser Glu Tyr Asn Lys Ala Thr Glu Asp Glu Tyr Tyr Arg Arg Leu Leu 35 40 45 cgg gtg cta cga gcc agg gag cag atc gtg ggc ggg gtg aat tac ttc 192 Arg Val Leu Arg Ala Arg Glu Gln Ile Val Gly Gly Val Asn Tyr Phe 50 55 60 ttc gac ata gag gtg ggc cga acc ata tgt acc aag tcc cag ccc aac 240 Phe Asp Ile Glu Val Gly Arg Thr Ile Cys Thr Lys Ser Gln Pro Asn 65 70 75 80 ttg gac acc tgt gcc ttc cat gaa cag cca gaa ctg cag aag aaa cag 288 Leu Asp Thr Cys Ala Phe His Glu Gln Pro Glu Leu Gln Lys Lys Gln 85 90 95 ttg tgc tct ttc cag atc tac gaa gtt ccc tgg gag gac aga atg tcc 336 Leu Cys Ser Phe Gln Ile Tyr Glu Val Pro Trp Glu Asp Arg Met Ser 100 105 110 ctg gtg aat tcc agg tgt caa gaa gcc tag 366 Leu Val Asn Ser Arg Cys Gln Glu Ala 115 120 8 121 PRT Homo sapiens 8 Trp Ser Pro Gln Glu Glu Asp Arg Ile Ile Glu Gly Gly Ile Tyr Asp 1 5 10 15 Ala Asp Leu Asn Asp Glu Arg Val Gln Arg Ala Leu His Phe Val Ile 20 25 30 Ser Glu Tyr Asn Lys Ala Thr Glu Asp Glu Tyr Tyr Arg Arg Leu Leu 35 40 45 Arg Val Leu Arg Ala Arg Glu Gln Ile Val Gly Gly Val Asn Tyr Phe 50 55 60 Phe Asp Ile Glu Val Gly Arg Thr Ile Cys Thr Lys Ser Gln Pro Asn 65 70 75 80 Leu Asp Thr Cys Ala Phe His Glu Gln Pro Glu Leu Gln Lys Lys Gln 85 90 95 Leu Cys Ser Phe Gln Ile Tyr Glu Val Pro Trp Glu Asp Arg Met Ser 100 105 110 Leu Val Asn Ser Arg Cys Gln Glu Ala 115 120 9 369 DNA Homo sapiens CDS (1)..(369) 9 ggg agt gcc tcg gcc caa tct agg acc ttg gca ggt ggc atc cat gcc 48 Gly Ser Ala Ser Ala Gln Ser Arg Thr Leu Ala Gly Gly Ile His Ala 1 5 10 15 aca gac ctc aat gac aag agt gtg cag cgt gcc ctg gac ttt gcc atc 96 Thr Asp Leu Asn Asp Lys Ser Val Gln Arg Ala Leu Asp Phe Ala Ile 20 25 30 agc gag tac aac aag gtc att aat aag gat gag tac tac agc cgc cct 144 Ser Glu Tyr Asn Lys Val Ile Asn Lys Asp Glu Tyr Tyr Ser Arg Pro 35 40 45 ctg cag gtg atg gct gcc tac cag cag atc gtg ggt ggg gtg aac tac 192 Leu Gln Val Met Ala Ala Tyr Gln Gln Ile Val Gly Gly Val Asn Tyr 50 55 60 tac ttc aat gtg aag ttc ggt cga acc aca tgc acc aag tcc cag ccc 240 Tyr Phe Asn Val Lys Phe Gly Arg Thr Thr Cys Thr Lys Ser Gln Pro 65 70 75 80 aac ttg gac aac tgt ccc ttc aat gac cag cca aaa ctg aaa gag gaa 288 Asn Leu Asp Asn Cys Pro Phe Asn Asp Gln Pro Lys Leu Lys Glu Glu 85 90 95 gag ttc tgc tct ttc cag atc aat gaa gtt ccc tgg gag gat aaa att 336 Glu Phe Cys Ser Phe Gln Ile Asn Glu Val Pro Trp Glu Asp Lys Ile 100 105 110 tcc att ctg aac tac aag tgc cgg aaa gtc tag 369 Ser Ile Leu Asn Tyr Lys Cys Arg Lys Val 115 120 10 122 PRT Homo sapiens 10 Gly Ser Ala Ser Ala Gln Ser Arg Thr Leu Ala Gly Gly Ile His Ala 1 5 10 15 Thr Asp Leu Asn Asp Lys Ser Val Gln Arg Ala Leu Asp Phe Ala Ile 20 25 30 Ser Glu Tyr Asn Lys Val Ile Asn Lys Asp Glu Tyr Tyr Ser Arg Pro 35 40 45 Leu Gln Val Met Ala Ala Tyr Gln Gln Ile Val Gly Gly Val Asn Tyr 50 55 60 Tyr Phe Asn Val Lys Phe Gly Arg Thr Thr Cys Thr Lys Ser Gln Pro 65 70 75 80 Asn Leu Asp Asn Cys Pro Phe Asn Asp Gln Pro Lys Leu Lys Glu Glu 85 90 95 Glu Phe Cys Ser Phe Gln Ile Asn Glu Val Pro Trp Glu Asp Lys Ile 100 105 110 Ser Ile Leu Asn Tyr Lys Cys Arg Lys Val 115 120 11 387 DNA Homo sapiens CDS (1)..(387) 11 ctg cca cgc gat gcc cgg gcc cgg ccg cag gag cgc atg gtc gga gaa 48 Leu Pro Arg Asp Ala Arg Ala Arg Pro Gln Glu Arg Met Val Gly Glu 1 5 10 15 ctc cgg gac ctg tcg ccc gac gac ccg cag gtg cag aag gcg gcg cag 96 Leu Arg Asp Leu Ser Pro Asp Asp Pro Gln Val Gln Lys Ala Ala Gln 20 25 30 gcg gcc gtg gcc agc tac aac atg ggc agc aac agc atc tac tac ttc 144 Ala Ala Val Ala Ser Tyr Asn Met Gly Ser Asn Ser Ile Tyr Tyr Phe 35 40 45 cga gac acg cac atc atc aag gcg cag agc cag ctg gtg gcc ggc atc 192 Arg Asp Thr His Ile Ile Lys Ala Gln Ser Gln Leu Val Ala Gly Ile 50 55 60 aag tac ttc ctg acg atg gag atg ggg agc aca gac tgc cgc aag acc 240 Lys Tyr Phe Leu Thr Met Glu Met Gly Ser Thr Asp Cys Arg Lys Thr 65 70 75 80 agg gtc act gga gac cac gtc gac ctc acc act tgc ccc ctg gca gca 288 Arg Val Thr Gly Asp His Val Asp Leu Thr Thr Cys Pro Leu Ala Ala 85 90 95 ggg gcg cag cag gag aag ctg cgc tgt gac ttt gag gtc ctt gtg gtt 336 Gly Ala Gln Gln Glu Lys Leu Arg Cys Asp Phe Glu Val Leu Val Val 100 105 110 ccc tgg cag aac tcc tct cag ctc cta aag cac aac tgt gtg cag atg 384 Pro Trp Gln Asn Ser Ser Gln Leu Leu Lys His Asn Cys Val Gln Met 115 120 125 tga 387 12 128 PRT Homo sapiens 12 Leu Pro Arg Asp Ala Arg Ala Arg Pro Gln Glu Arg Met Val Gly Glu 1 5 10 15 Leu Arg Asp Leu Ser Pro Asp Asp Pro Gln Val Gln Lys Ala Ala Gln 20 25 30 Ala Ala Val Ala Ser Tyr Asn Met Gly Ser Asn Ser Ile Tyr Tyr Phe 35 40 45 Arg Asp Thr His Ile Ile Lys Ala Gln Ser Gln Leu Val Ala Gly Ile 50 55 60 Lys Tyr Phe Leu Thr Met Glu Met Gly Ser Thr Asp Cys Arg Lys Thr 65 70 75 80 Arg Val Thr Gly Asp His Val Asp Leu Thr Thr Cys Pro Leu Ala Ala 85 90 95 Gly Ala Gln Gln Glu Lys Leu Arg Cys Asp Phe Glu Val Leu Val Val 100 105 110 Pro Trp Gln Asn Ser Ser Gln Leu Leu Lys His Asn Cys Val Gln Met 115 120 125 13 366 DNA Homo sapiens CDS (1)..(366) 13 cgg ccg cag gag cgc atg gtc gga gaa ctc cgg gac ctg tcg ccc gac 48 Arg Pro Gln Glu Arg Met Val Gly Glu Leu Arg Asp Leu Ser Pro Asp 1 5 10 15 gac ccg cag gtg cag aag gcg gcg cag gcg gcc gtg gcc agc tac aac 96 Asp Pro Gln Val Gln Lys Ala Ala Gln Ala Ala Val Ala Ser Tyr Asn 20 25 30 atg ggc agc aac agc atc tac tac ttc cga gac acg cac atc atc aag 144 Met Gly Ser Asn Ser Ile Tyr Tyr Phe Arg Asp Thr His Ile Ile Lys 35 40 45 gcg cag agc cag ctg gtg gcc ggc atc aag tac ttc ctg acg atg gag 192 Ala Gln Ser Gln Leu Val Ala Gly Ile Lys Tyr Phe Leu Thr Met Glu 50 55 60 atg ggg agc aca gac tgc cgc aag acc agg gtc act gga gac cac gtc 240 Met Gly Ser Thr Asp Cys Arg Lys Thr Arg Val Thr Gly Asp His Val 65 70 75 80 gac ctc acc act tgc ccc ctg gca gca ggg gcg cag cag gag aag ctg 288 Asp Leu Thr Thr Cys Pro Leu Ala Ala Gly Ala Gln Gln Glu Lys Leu 85 90 95 cgc tgt gac ttt gag gtc ctt gtg gtt ccc tgg cag aac tcc tct cag 336 Arg Cys Asp Phe Glu Val Leu Val Val Pro Trp Gln Asn Ser Ser Gln 100 105 110 ctc cta aag cac aac tgt gtg cag atg tga 366 Leu Leu Lys His Asn Cys Val Gln Met 115 120 14 121 PRT Homo sapiens 14 Arg Pro Gln Glu Arg Met Val Gly Glu Leu Arg Asp Leu Ser Pro Asp 1 5 10 15 Asp Pro Gln Val Gln Lys Ala Ala Gln Ala Ala Val Ala Ser Tyr Asn 20 25 30 Met Gly Ser Asn Ser Ile Tyr Tyr Phe Arg Asp Thr His Ile Ile Lys 35 40 45 Ala Gln Ser Gln Leu Val Ala Gly Ile Lys Tyr Phe Leu Thr Met Glu 50 55 60 Met Gly Ser Thr Asp Cys Arg Lys Thr Arg Val Thr Gly Asp His Val 65 70 75 80 Asp Leu Thr Thr Cys Pro Leu Ala Ala Gly Ala Gln Gln Glu Lys Leu 85 90 95 Arg Cys Asp Phe Glu Val Leu Val Val Pro Trp Gln Asn Ser Ser Gln 100 105 110 Leu Leu Lys His Asn Cys Val Gln Met 115 120 15 351 DNA Gallus sp. CDS (1)..(351) 15 agc gag gac cgc tcc cgg ctc ctg ggg gct cca gtg cct gta gat gag 48 Ser Glu Asp Arg Ser Arg Leu Leu Gly Ala Pro Val Pro Val Asp Glu 1 5 10 15 aac gac gag ggc ttg caa cgg gcc ctg cag ttc gcg atg gcc gag tac 96 Asn Asp Glu Gly Leu Gln Arg Ala Leu Gln Phe Ala Met Ala Glu Tyr 20 25 30 aac agg gcc agc aac gat aag tac tcc agc cgg gtg gtg cgg gtc atc 144 Asn Arg Ala Ser Asn Asp Lys Tyr Ser Ser Arg Val Val Arg Val Ile 35 40 45 agc gcc aag cgg cag ctc gtg tct gga atc aag tac atc ctg cag gtt 192 Ser Ala Lys Arg Gln Leu Val Ser Gly Ile Lys Tyr Ile Leu Gln Val 50 55 60 gag att ggt cgc aca act tgc ccc aag tca tca ggt gat ctc cag agc 240 Glu Ile Gly Arg Thr Thr Cys Pro Lys Ser Ser Gly Asp Leu Gln Ser 65 70 75 80 tgc gaa ttc cac gat gag cca gag atg gct aag tat acc aca tgc acc 288 Cys Glu Phe His Asp Glu Pro Glu Met Ala Lys Tyr Thr Thr Cys Thr 85 90 95 ttt gta gtg tac agt att cct tgg cta aac caa att aaa ctg ctg gaa 336 Phe Val Val Tyr Ser Ile Pro Trp Leu Asn Gln Ile Lys Leu Leu Glu 100 105 110 agc aag tgc cag taa 351 Ser Lys Cys Gln 115 16 116 PRT Gallus sp. 16 Ser Glu Asp Arg Ser Arg Leu Leu Gly Ala Pro Val Pro Val Asp Glu 1 5 10 15 Asn Asp Glu Gly Leu Gln Arg Ala Leu Gln Phe Ala Met Ala Glu Tyr 20 25 30 Asn Arg Ala Ser Asn Asp Lys Tyr Ser Ser Arg Val Val Arg Val Ile 35 40 45 Ser Ala Lys Arg Gln Leu Val Ser Gly Ile Lys Tyr Ile Leu Gln Val 50 55 60 Glu Ile Gly Arg Thr Thr Cys Pro Lys Ser Ser Gly Asp Leu Gln Ser 65 70 75 80 Cys Glu Phe His Asp Glu Pro Glu Met Ala Lys Tyr Thr Thr Cys Thr 85 90 95 Phe Val Val Tyr Ser Ile Pro Trp Leu Asn Gln Ile Lys Leu Leu Glu 100 105 110 Ser Lys Cys Gln 115 17 336 DNA Cyprinus carpio CDS (1)..(336) 17 act ggg att cct gga ggc ctt gta gat gca gac att aac gat aaa gat 48 Thr Gly Ile Pro Gly Gly Leu Val Asp Ala Asp Ile Asn Asp Lys Asp 1 5 10 15 gtt cag aag gcg tta cgc ttc gca gtg gac cat tac aac ggc caa agc 96 Val Gln Lys Ala Leu Arg Phe Ala Val Asp His Tyr Asn Gly Gln Ser 20 25 30 aac gat gcg ttt gtg cgt aaa gtt tcc aaa gta atc aag gtt caa caa 144 Asn Asp Ala Phe Val Arg Lys Val Ser Lys Val Ile Lys Val Gln Gln 35 40 45 caa gtt gcc gct ggc atg aaa tac atc ttc act gtg aag atg gaa gta 192 Gln Val Ala Ala Gly Met Lys Tyr Ile Phe Thr Val Lys Met Glu Val 50 55 60 gcc tcc tgc aaa aag ggt gga gtt aag acc atg tgt gcc gtt ccg aag 240 Ala Ser Cys Lys Lys Gly Gly Val Lys Thr Met Cys Ala Val Pro Lys 65 70 75 80 aat ccc agt att gaa cag gtc att cag tgc aaa ata acg gtc tgg agc 288 Asn Pro Ser Ile Glu Gln Val Ile Gln Cys Lys Ile Thr Val Trp Ser 85 90 95 cag cca tgg tta aac tcc ttg aaa gtc act gaa aac acc tgc atg tag 336 Gln Pro Trp Leu Asn Ser Leu Lys Val Thr Glu Asn Thr Cys Met 100 105 110 18 111 PRT Cyprinus carpio 18 Thr Gly Ile Pro Gly Gly Leu Val Asp Ala Asp Ile Asn Asp Lys Asp 1 5 10 15 Val Gln Lys Ala Leu Arg Phe Ala Val Asp His Tyr Asn Gly Gln Ser 20 25 30 Asn Asp Ala Phe Val Arg Lys Val Ser Lys Val Ile Lys Val Gln Gln 35 40 45 Gln Val Ala Ala Gly Met Lys Tyr Ile Phe Thr Val Lys Met Glu Val 50 55 60 Ala Ser Cys Lys Lys Gly Gly Val Lys Thr Met Cys Ala Val Pro Lys 65 70 75 80 Asn Pro Ser Ile Glu Gln Val Ile Gln Cys Lys Ile Thr Val Trp Ser 85 90 95 Gln Pro Trp Leu Asn Ser Leu Lys Val Thr Glu Asn Thr Cys Met 100 105 110 19 336 DNA Oncorhynchus keta CDS (1)..(336) 19 ggt ttg gtc gga ggc ccc atg gac gca aat atg aac gac caa gga acg 48 Gly Leu Val Gly Gly Pro Met Asp Ala Asn Met Asn Asp Gln Gly Thr 1 5 10 15 aga gac gcc ctg cag ttc gcg gtg gtc gaa cac aac aag aaa aca aac 96 Arg Asp Ala Leu Gln Phe Ala Val Val Glu His Asn Lys Lys Thr Asn 20 25 30 gac atg ttt gtc agg cag gtg gcc aag gtt gtc aat gca cag aaa cag 144 Asp Met Phe Val Arg Gln Val Ala Lys Val Val Asn Ala Gln Lys Gln 35 40 45 gtg gta tct ggg atg aag tac atc ttc aca gtg cag atg ggc agg acc 192 Val Val Ser Gly Met Lys Tyr Ile Phe Thr Val Gln Met Gly Arg Thr 50 55 60 cca tgc agg aag gga ggt gtt gag aag atc tgc tcc gtg cac aaa gac 240 Pro Cys Arg Lys Gly Gly Val Glu Lys Ile Cys Ser Val His Lys Asp 65 70 75 80 ccg cag atg gct gtg ccc tac aag tgc acc ttc gag gtg tgg agc cgc 288 Pro Gln Met Ala Val Pro Tyr Lys Cys Thr Phe Glu Val Trp Ser Arg 85 90 95 ccc tgg atg agc gat atc cag atg gtc aag aac cag tgt gaa agt taa 336 Pro Trp Met Ser Asp Ile Gln Met Val Lys Asn Gln Cys Glu Ser 100 105 110 20 111 PRT Oncorhynchus keta 20 Gly Leu Val Gly Gly Pro Met Asp Ala Asn Met Asn Asp Gln Gly Thr 1 5 10 15 Arg Asp Ala Leu Gln Phe Ala Val Val Glu His Asn Lys Lys Thr Asn 20 25 30 Asp Met Phe Val Arg Gln Val Ala Lys Val Val Asn Ala Gln Lys Gln 35 40 45 Val Val Ser Gly Met Lys Tyr Ile Phe Thr Val Gln Met Gly Arg Thr 50 55 60 Pro Cys Arg Lys Gly Gly Val Glu Lys Ile Cys Ser Val His Lys Asp 65 70 75 80 Pro Gln Met Ala Val Pro Tyr Lys Cys Thr Phe Glu Val Trp Ser Arg 85 90 95 Pro Trp Met Ser Asp Ile Gln Met Val Lys Asn Gln Cys Glu Ser 100 105 110 21 336 DNA Oncorhynchus mykiss CDS (1)..(336) 21 ggt ttg atc gga ggc ccc atg gac gca aat atg aac gac caa gga acg 48 Gly Leu Ile Gly Gly Pro Met Asp Ala Asn Met Asn Asp Gln Gly Thr 1 5 10 15 aga gac gcc ctg cag ttc gcg gtg gtc gaa cac aac aag aaa aca aac 96 Arg Asp Ala Leu Gln Phe Ala Val Val Glu His Asn Lys Lys Thr Asn 20 25 30 gac atg ttt gtc agg cag gtg gcc aag gtt gtc aat gca cag aag cag 144 Asp Met Phe Val Arg Gln Val Ala Lys Val Val Asn Ala Gln Lys Gln 35 40 45 gtg gta tct ggg atg aag tac atc ttc aca gtg cag atg ggc agg acc 192 Val Val Ser Gly Met Lys Tyr Ile Phe Thr Val Gln Met Gly Arg Thr 50 55 60 cca tgc agg aag gga ggt gtt gag aag gtc tgc tcc gtg cac aag gac 240 Pro Cys Arg Lys Gly Gly Val Glu Lys Val Cys Ser Val His Lys Asp 65 70 75 80 cca cag atg gct gtg ccc tac aag tgc acc ttc gag gtg tgg agc cgc 288 Pro Gln Met Ala Val Pro Tyr Lys Cys Thr Phe Glu Val Trp Ser Arg 85 90 95 ccc tgg atg agc gat atc cag atg gtc aag aac cag tgt gaa agt taa 336 Pro Trp Met Ser Asp Ile Gln Met Val Lys Asn Gln Cys Glu Ser 100 105 110 22 111 PRT Oncorhynchus mykiss 22 Gly Leu Ile Gly Gly Pro Met Asp Ala Asn Met Asn Asp Gln Gly Thr 1 5 10 15 Arg Asp Ala Leu Gln Phe Ala Val Val Glu His Asn Lys Lys Thr Asn 20 25 30 Asp Met Phe Val Arg Gln Val Ala Lys Val Val Asn Ala Gln Lys Gln 35 40 45 Val Val Ser Gly Met Lys Tyr Ile Phe Thr Val Gln Met Gly Arg Thr 50 55 60 Pro Cys Arg Lys Gly Gly Val Glu Lys Val Cys Ser Val His Lys Asp 65 70 75 80 Pro Gln Met Ala Val Pro Tyr Lys Cys Thr Phe Glu Val Trp Ser Arg 85 90 95 Pro Trp Met Ser Asp Ile Gln Met Val Lys Asn Gln Cys Glu Ser 100 105 110 23 357 DNA Bos taurus CDS (1)..(357) 23 cag ggc cct agg aag ggt cgc ctg ctg ggc ggc ctg atg gag gcg gac 48 Gln Gly Pro Arg Lys Gly Arg Leu Leu Gly Gly Leu Met Glu Ala Asp 1 5 10 15 gtc aat gag gag ggc gtg cag gag gcg ctg tcc ttt gcg gtc agc gag 96 Val Asn Glu Glu Gly Val Gln Glu Ala Leu Ser Phe Ala Val Ser Glu 20 25 30 ttc aac aag cgg agc aac gac gct tac cag agc cgc gtg gtg cgc gtg 144 Phe Asn Lys Arg Ser Asn Asp Ala Tyr Gln Ser Arg Val Val Arg Val 35 40 45 gtg cgc gcc cgc aag cag gtc gtg tca ggg atg aac tat ttc ttg gac 192 Val Arg Ala Arg Lys Gln Val Val Ser Gly Met Asn Tyr Phe Leu Asp 50 55 60 gtg gag ctt ggc cgg act aca tgt acc aag tcc cag gcc aac ttt gac 240 Val Glu Leu Gly Arg Thr Thr Cys Thr Lys Ser Gln Ala Asn Phe Asp 65 70 75 80 agc tgt ccc ttc cat aac cag ccg cac ctg aag agg gaa aag ctg tgc 288 Ser Cys Pro Phe His Asn Gln Pro His Leu Lys Arg Glu Lys Leu Cys 85 90 95 tcc ttc cag gtt tac gtc gtc cca tgg atg aac acc atc aac ctg gtg 336 Ser Phe Gln Val Tyr Val Val Pro Trp Met Asn Thr Ile Asn Leu Val 100 105 110 aag ttt agc tgc cag gat taa 357 Lys Phe Ser Cys Gln Asp 115 24 118 PRT Bos taurus 24 Gln Gly Pro Arg Lys Gly Arg Leu Leu Gly Gly Leu Met Glu Ala Asp 1 5 10 15 Val Asn Glu Glu Gly Val Gln Glu Ala Leu Ser Phe Ala Val Ser Glu 20 25 30 Phe Asn Lys Arg Ser Asn Asp Ala Tyr Gln Ser Arg Val Val Arg Val 35 40 45 Val Arg Ala Arg Lys Gln Val Val Ser Gly Met Asn Tyr Phe Leu Asp 50 55 60 Val Glu Leu Gly Arg Thr Thr Cys Thr Lys Ser Gln Ala Asn Phe Asp 65 70 75 80 Ser Cys Pro Phe His Asn Gln Pro His Leu Lys Arg Glu Lys Leu Cys 85 90 95 Ser Phe Gln Val Tyr Val Val Pro Trp Met Asn Thr Ile Asn Leu Val 100 105 110 Lys Phe Ser Cys Gln Asp 115 25 115 DNA Artificial Sequence Description of Artificial Sequence first of four oligonucleotides used to create a nucleotide coding for modified human cystatin C 25 gtatctctcg agaaaagatc ttctccaggt aagccaccaa gattggtcgg tggtccaatg 60 gacgcctctg tcgaggagga gggtgtcaga agagccttgg acttcgccgt cggtg 115 26 115 DNA Artificial Sequence Description of Artificial Sequence the second of four oligonucleotides used to create a nucleotide coding for synthetic human cystatin C 26 caagaagtag ttgacaccgg cgacaatttg ctttctggct ctgacgactt gcaaggctct 60 ggagtggtac atgtcgttag aggccttgtt gtactcaccg acggcgaagt ccaag 115 27 115 DNA Artificial Sequence Description of Artificial Sequence the third of four oligonucleotides used to create a nucleotide coding for synthetic human cystatin C 27 caaattgtcg ccggtgtcaa ctacttcttg gacgttgagt tgggtagaac tacttgtact 60 aagactcaac caaacttgac taactgtcca ttccacgacc aaccacactt gaaga 115 28 115 DNA Artificial Sequence Description of Artificial Sequence the fourth of four oligonucleotides used to create a nucleotide coding for synthetic human cystatin C 28 tgttctagat caggcgtctt gacaagtaga cttagacaaa gtcatagtac cttgccatgg 60 gacggcgtaa atttggaaag aacagaaggc ctttctcttc aagtgtggtt ggtcg 115 29 30 DNA Artificial Sequence Description of Artificial Sequence forward primer used in site-directed mutagenesis to intro. a glycosylation site at residue 35 of a modified human cystatin C 29 ggtgagtaca acaagtcctc taacgacatg 30 30 30 DNA Artificial Sequence Description of Artificial Sequence reverse primer used in site-directed mutagenesis to intro. a glycosylation site at residue 35 of a modified human cystatin C 30 catgtcgtta gaggacttgt tgtactcacc 30 31 30 DNA Artificial Sequence Description of Artificial Sequence forward primer used in site-directed mutagenesis to intro. a glycosylation site at residue 36 of a modified human cystatin C 31 ggtgagtaca acaacgcctc taacgacatg 30 32 30 DNA Artificial Sequence Description of Artificial Sequence reverse primer used in site-directed mutagenesis to intro. a glycosylation site at residue 36 of a modified human cystatin C 32 catgtcgtta gaggcgttgt tgtactcacc 30 

What is claimed:
 1. A modified cystatin selected from the group consisting of: a) modified human cystatin C having at least one of the following modifications: K (36) N A (37) S A (37) T D(81)S D (81) T b) modified human cystatin S having at least one of the following modifications: A (31) N K (37) N A (38) S A (38) T L (81) N D (82) S D (82) T c) modified human cystatin SN having at least one of the following modifications: A (31) N A (38) S A (38) T K (37) N L (81) N D (82) S D (82) T d) modified human cystatin SA having at least one of the modifications selected from the group consisting of: V (31) N A (38) S A (38) T K (37) N D (82) S D (82) T L (81) N e) modified human cystatin D having at least one of the modifications selected from the group consisting of: A (31) N V (38) S V (38) T D (42) S D (42) T Y (44) N D (83) S D (83) T P (86) S P (86) T Q (90) S Q (90) T f) modified human cystatin M having at least one of the modifications selected from the group consisting of: V (35) N M (40) N G (41) S G (41) T S (42) N I (45) S I (45) T R (78) N R (81) N D (88) N L (89) N g) modified human cystatin E having at least one of the modifications selected from the group consisting of: V (28) N M (33) N G (34) S G (34) T S (35) N I (38)S I (38) T D (81) N L (82) N h) modified egg white cystatin having at least one of the modifications selected from the group consisting of: A (35) S A (35) T R (34) N K (39) S K (39) T K (39) N Y (40) N L (78) N K (91) N Y (92) N i) modified bovine cystatin having at least one of the modifications selected from the group consisting of: A (29) N R (36) S R (36) T K (35) N A (40) S A (40) T Y (41) N L (79) N D (80) S D (80) T P (88) T P (88) S j) modified carp cystatin having at least one of the modifications selected from the group consisting of: Q(31)S Q (31) T G (30) N A (35) S A (35) T K (39) N K (91) N k) modified trout cystatin having at least one of the modifications selected from the group consisting of: K (29) N K (30) S K (30) T M (34) T M (34) S K (88) N I) A modified chum salmon cystatin having at least one of the modifications selected from the group consisting of: K (29) N K (30) S K (30) T M (34) S M (34) T K (88) N
 2. A nucleic acid molecule encoding a modified cystatin according to claim
 1. 3. A cell comprising a nucleic acid molecule according to claim
 2. 4. A cell according to claim 3 wherein the cell is selected from the group consisting of: a yeast cell, an insect cell, a human cell, a bacterial cell and a plant cell.
 5. A method of inhibiting proteolysis of a protein substrate by contacting the cystatin protein substrate with the modified cystatin of claim
 1. 6. A modified human cystatin C having at least one of the following modifications: K (36) N A (37) S A (37) T D (81) S D (81) T
 7. A nucleic acid molecule encoding a modified cystatin according to claim
 6. 8. A cell comprising a nucleic acid molecule according to claim
 7. 9. A cell according to claim 8 wherein the cell is selected from the group consisting of: a yeast cell, an insect cell, a human cell, a bacterial cell and a plant cell.
 10. A method of inhibiting proteolysis of a protein substrate by contacting said cystatin protein substrate with a modified cystatin according to claim
 6. 